text and data mining
X-Nav: Learning End-to-End Cross-Embodiment Navigation for Mobile Robots
Wang, Haitong, Tan, Aaron Hao, Fung, Angus, Nejat, Goldie
Existing navigation methods are primarily designed for specific robot embodiments, limiting their generalizability across diverse robot platforms. In this paper, we introduce X-Nav, a novel framework for end-to-end cross-embodiment navigation where a single unified policy can be deployed across various embodiments for both wheeled and quadrupedal robots. X-Nav consists of two learning stages: 1) multiple expert policies are trained using deep reinforcement learning with privileged observations on a wide range of randomly generated robot embodiments; and 2) a single general policy is distilled from the expert policies via navigation action chunking with transformer (Nav-ACT). The general policy directly maps visual and proprioceptive observations to low-level control commands, enabling generalization to novel robot embodiments. Simulated experiments demonstrated that X-Nav achieved zero-shot transfer to both unseen embodiments and photorealistic environments. A scalability study showed that the performance of X-Nav improves when trained with an increasing number of randomly generated embodiments. An ablation study confirmed the design choices of X-Nav. Furthermore, real-world experiments were conducted to validate the generalizability of X-Nav in real-world environments.
PLLuM: A Family of Polish Large Language Models
Kocoń, Jan, Piasecki, Maciej, Janz, Arkadiusz, Ferdinan, Teddy, Radliński, Łukasz, Koptyra, Bartłomiej, Oleksy, Marcin, Woźniak, Stanisław, Walkowiak, Paweł, Wojtasik, Konrad, Moska, Julia, Naskręt, Tomasz, Walkowiak, Bartosz, Gniewkowski, Mateusz, Szyc, Kamil, Motyka, Dawid, Banach, Dawid, Dalasiński, Jonatan, Rudnicka, Ewa, Alberski, Bartłomiej, Walkowiak, Tomasz, Szczęsny, Aleksander, Markiewicz, Maciej, Bernaś, Tomasz, Mazur, Hubert, Żyta, Kamil, Tykierko, Mateusz, Chodak, Grzegorz, Kajdanowicz, Tomasz, Kazienko, Przemysław, Karlińska, Agnieszka, Seweryn, Karolina, Kołos, Anna, Chrabąszcz, Maciej, Lorenc, Katarzyna, Krasnodębska, Aleksandra, Wilczek, Artur, Dziewulska, Katarzyna, Betscher, Paula, Cieślińska, Zofia, Kowol, Katarzyna, Mikoś, Daria, Trzciński, Maciej, Krutul, Dawid, Kozłowski, Marek, Dadas, Sławomir, Poświata, Rafał, Perełkiewicz, Michał, Grębowiec, Małgorzata, Kazuła, Maciej, Białas, Marcin, Roszko, Roman, Roszko, Danuta, Vaičenonienė, Jurgita, Utka, Andrius, Levchuk, Paweł, Kowalski, Paweł, Prawdzic-Jankowska, Irena, Ogrodniczuk, Maciej, Borys, Monika, Bulińska, Anna, Gumienna, Wiktoria, Kieraś, Witold, Komosińska, Dorota, Krasnowska-Kieraś, Katarzyna, Kobyliński, Łukasz, Lewandowska, Martyna, Łaziński, Marek, Łątkowski, Mikołaj, Mastalerz, Dawid, Milewicz, Beata, Mykowiecka, Agnieszka Anna, Peljak-Łapińska, Angelika, Penno, Sandra, Przybysz, Zuzanna, Rudolf, Michał, Rybak, Piotr, Saputa, Karolina, Tomaszewska, Aleksandra, Wawer, Aleksander, Woliński, Marcin, Wołoszyn, Joanna, Wróblewska, Alina, Żuk, Bartosz, Żarnecki, Filip, Kaczyński, Konrad, Cichosz, Anna, Deckert, Zuzanna, Garnys, Monika, Grabarczyk, Izabela, Janowski, Wojciech, Karasińska, Sylwia, Kujawiak, Aleksandra, Misztela, Piotr, Szymańska, Maria, Walkusz, Karolina, Siek, Igor, Kwiatkowski, Jakub, Pęzik, Piotr
Large Language Models (LLMs) play a central role in modern artificial intelligence, yet their development has been primarily focused on English, resulting in limited support for other languages. We present PLLuM (Polish Large Language Model), the largest open-source family of foundation models tailored specifically for the Polish language. Developed by a consortium of major Polish research institutions, PLLuM addresses the need for high-quality, transparent, and culturally relevant language models beyond the English-centric commercial landscape. We describe the development process, including the construction of a new 140-billion-token Polish text corpus for pre-training, a 77k custom instructions dataset, and a 100k preference optimization dataset. A key component is a Responsible AI framework that incorporates strict data governance and a hybrid module for output correction and safety filtering. We detail the models' architecture, training procedures, and alignment techniques for both base and instruction-tuned variants, and demonstrate their utility in a downstream task within public administration. By releasing these models publicly, PLLuM aims to foster open research and strengthen sovereign AI technologies in Poland.
Task-Oriented Semantic Communication in Large Multimodal Models-based Vehicle Networks
Du, Baoxia, Du, Hongyang, Niyato, Dusit, Li, Ruidong
Task-oriented semantic communication has emerged as a fundamental approach for enhancing performance in various communication scenarios. While recent advances in Generative Artificial Intelligence (GenAI), such as Large Language Models (LLMs), have been applied to semantic communication designs, the potential of Large Multimodal Models (LMMs) remains largely unexplored. In this paper, we investigate an LMM-based vehicle AI assistant using a Large Language and Vision Assistant (LLaVA) and propose a task-oriented semantic communication framework to facilitate efficient interaction between users and cloud servers. To reduce computational demands and shorten response time, we optimize LLaVA's image slicing to selectively focus on areas of utmost interest to users. Additionally, we assess the importance of image patches by combining objective and subjective user attention, adjusting energy usage for transmitting semantic information. This strategy optimizes resource utilization, ensuring precise transmission of critical information. We construct a Visual Question Answering (VQA) dataset for traffic scenarios to evaluate effectiveness. Experimental results show that our semantic communication framework significantly increases accuracy in answering questions under the same channel conditions, performing particularly well in environments with poor Signal-to-Noise Ratios (SNR). Accuracy can be improved by 13.4% at an SNR of 12dB and 33.1% at 10dB, respectively.
PythonPal: Enhancing Online Programming Education through Chatbot-Driven Personalized Feedback
The rise of online programming education has necessitated more effective, personalized interactions, a gap that PythonPal aims to fill through its innovative learning system integrated with a chatbot. This research delves into PythonPal's potential to enhance the online learning experience, especially in contexts with high student-to-teacher ratios where there is a need for personalized feedback. PythonPal's design, featuring modules for conversation, tutorials, and exercises, was evaluated through student interactions and feedback. Key findings reveal PythonPal's proficiency in syntax error recognition and user query comprehension, with its intent classification model showing high accuracy. The system's performance in error feedback, though varied, demonstrates both strengths and areas for enhancement. Student feedback indicated satisfactory query understanding and feedback accuracy but also pointed out the need for faster responses and improved interaction quality. PythonPal's deployment promises to significantly enhance online programming education by providing immediate, personalized feedback and interactive learning experiences, fostering a deeper understanding of programming concepts among students. These benefits mark a step forward in addressing the challenges of distance learning, making programming education more accessible and effective.
EU accused of leaving 'devastating' copyright loophole in AI Act
"What I do not understand is that we are supporting big tech instead of protecting European creative ideas and content." The EU's AI Act, which came into force last year, was already in the works when ChatGPT, an AI chatbot that can generate essays, jokes and job applications, burst into public consciousness in late 2022, becoming the fastest-growing consumer application in history. ChatGPT was developed by OpenAI, which is also behind the AI image generator Dall-E. He would like legislation to fill that gap, but said it would take years, after the European Commission's decision last week to withdraw the proposed AI Liability Act. "It might be getting very difficult.
The UK rolls back controversial plans to open up text and data mining regulations • TechCrunch
The U.K. Government is seemingly backtracking on plans that would have allowed text and data mining "for any purpose," plans designed to position the U.K. as a "global AI superpower." The news emerges following months of blowback from creative industries concerned about what impact the rules might have on protected works. Text and data mining, for the uninitiated, is an essential component of just about every AI application, allowing researchers and developers to leverage disparate datasets to train their algorithms. But gaining access to a sufficient amount of data is not a straight-forward endeavor, given that data is often owned by organizations or individuals that might not want third-parties to have access to their data. Or, they may only make said data available under a commercial license, making it prohibitively expensive to harness.
We are tearing up creative rights to feed a flawed Whitehall obsession with AI
There's no reason you should have ever heard of Simon Squibb, the "chief purpose officer of the Purposeful Project". Mr Squibb, who describes himself as an "Elon Musk wanna-be" in his Twitter profile, is one of those tirelessly energetic mid-life influencers who proliferate on the petri dish of LinkedIn. Displaying the sort of enthusiasm that Matt Hancock reserves for a Bushtucker Challenge, Mr Squibb is on a mission. "I want to fix the education system", he says. This fix entails removing something that many of us consider quite an important part of the education system: the learning part.
All Change (but Not Just Yet) When It Comes to AI and IP
Artificial Intelligence (AI) has the potential to transform many aspects of life and the UK government has recognized that it is important to review IP laws to ensure that they evolve and promote innovation in this fast-paced area of technology. That was the motivation behind a recent UKIPO consultation which reported earlier this week. With regards to patent protection for AI-devised inventions, the report concluded that no changes are required to UK patent law, at least for the time being. At present, despite claims of certain parties and the international court case relating to the DABUS system which its promotors sought to name as the inventor on patent applications in a number of countries, there is no evidence of AI currently having the capacity to invent. Rather, the general consensus from respondents was that AI technology cannot, at least at present, invent without human assistance.
UK to boost AI development by removing data mining hurdles – TechCrunch
The U.K. is planning to tweak an existing law to allow text and data mining "for any purpose," in a move that's designed to boost artificial intelligence (AI) development across the country. The announcement constitutes part of a broader strategy to "level up" AI and transform the U.K. into what it calls a "global AI superpower" -- and part of this will involve reassessing existing intellectual property (IP) laws. Following a two-month consultation period where stakeholders from across the industrial spectrum were asked for input, including rightsholders, academics, lawyers, trade organisations and businesses, the U.K.'s Intellectual Property Office (IPO) today published its response and confirmed what will (and won't) be changing moving forward. Text and data mining (TDM) is pivotal to the development of new AI applications, allowing researchers and businesses to copy and harness disparate datasets to train their algorithms. However, gaining access to enough relevant data has inherent challenges -- the data is often owned by third-parties that may only want to make data available under a commercial license, if they make it available at all.
Text and Data Mining of In-Copyright Works
Text and data mining (TDM) uses statistical analysis tools to extract new knowledge from large quantities of text or data for purposes by finding patterns, discovering relationships, and analyzing semantics. It is used in a wide variety of fields from biomedical research to digital humanities. Japan has enacted laws to allow TDM research copying. These lawsuits grew out of the Google Book Search Project (GBS). GBS is a corpus of millions of digital books to improve its search technologies that Google developed after making a deal with the University of Michigan in 2004 to scan all eight million books in its library's collections. In return, Michigan got back from Google digital copies of the books it scanned.